Visyllable Based Speech Animation

نویسندگان

  • Sumedha Kshirsagar
  • Nadia Magnenat-Thalmann
چکیده

Visemes are visual counterpart of phonemes. Traditionally, the speech animation of 3D synthetic faces involves extraction of visemes from input speech followed by the application of co-articulation rules to generate realistic animation. In this paper, we take a novel approach for speech animation – using visyllables, the visual counterpart of syllables. The approach results into a concatenative visyllable based speech animation system. The key contribution of this paper lies in two main areas. Firstly, we define a set of visyllable units for spoken English along with the associated phonological rules for valid syllables. Based on these rules, we have implemented a syllabification algorithm that allows segmentation of a given phoneme stream into syllables and subsequently visyllables. Secondly, we have recorded the database of visyllables using a facial motion capture system. The recorded visyllable units are post-processed semi-automatically to ensure continuity at the vowel boundaries of the visyllables. We define each visyllable in terms of the Facial Movement Parameters (FMP). The FMPs are obtained as a result of the statistical analysis of the facial motion capture data. The FMPs allow a compact representation of the visyllables. Further, the FMPs also facilitate the formulation of rules for boundary matching and smoothing after concatenating the visyllables units. Ours is the first visyllable based speech animation system. The proposed technique is easy to implement, effective for real-time as well as non real-time applications and results into realistic speech

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Stylized synthesis of facial speech motions

Stylized synthesis of facial speech motions is central to facial animation. Most synthesis algorithms put emphasis on the reasonable concatenation of captured motion segments. The dynamic modeling of speech units, e.g. visemes and visyllables (the visual appearance of a syllable), has not drawn much attention. In this paper, we address the fundamental issues regarding the stylized dynamic model...

متن کامل

A Speech Driven Face Animation System Based on Machine Learning

Lip synchronization is the key issue in speech driven face animation system. In this paper, some clustering and machine learning methods are combined together to estimate face animation parameters from audio sequences and then apply the learning results to MPEG-4 based speech driven face animation system. Based on a large recorded audio-visual database, an unsupervised cluster algorithm is prop...

متن کامل

The Study of Education Based on Animation in Patient’s Performance under Hemodialysis in Emergency Evacuation Selected Hospitals of Aja

Introduction: A disaster evacuation program is one of the most important parts of hospital crisis management. The following study was carried out to determine the effects of animation-based teaching on hemodialysis patients’ performance in an emergency evacuation. Material and Method: In this quasi-experimental study, two out of four AJA Hospitals in Tehran that had hemodialysis wards, were sel...

متن کامل

Automatic Visual Speech Animation

Visual speech animation, also known as lip synchronization, is the process of matching a speech audio file with the lips’ movements of a synthetic character. Visual speech is a very demanding task, being either fully manual, which is very time consuming, or with automatic methods based on data analysis. Currently, there is still no automatic method that generates any sequence of visual speech, ...

متن کامل

Data-Driven Speech Animation Synthesis Focusing on Realistic Inside of the Mouth

Speech animation synthesis is still a challenging topic in the field of computer graphics. Despite many challenges, representing detailed appearance of inner mouth such as nipping tongue’s tip with teeth and tongue’s back hasn’t been achieved in the resulting animation. To solve this problem, we propose a method of data-driven speech animation synthesis especially when focusing on the inside of...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Comput. Graph. Forum

دوره 22  شماره 

صفحات  -

تاریخ انتشار 2003